A PRIME NUMBER THEOREM FOR THE MAJORITY 

FUNCTION 



JEAN BOURGAIN 

Abstract. In the paper, the occurrence of zeros and ones in the 
binary expansion of the primes is studied. In particular the state- 
ment in the title is established. The proof is unconditional. 



1. Introduction 

Let N = 2 n and identify {0, 1, . . . , iV — 1} with {0, l} n by binary 
expansion 

x = xfl? with Xj = 0, 1. 

0<j<n 

Assuming n odd, denote / : {0, l} n — > {0,1} the majority function. 
Motivated by a question of G. Kalai jKa] . we prove that / does not 
correlate with the primes, i.e. 

Theorem 1. Let A be the Von Mangoldt function. Then 

mm « f ■ (i.i) 

l<:c<iV 

Note that the majority function is a monotone Boolean function and 
it was proven in |B3] that the Moebius function does not correlate 
with any monotone Boolean function. The proof of his property uses 
the concentration of the Fourier- Walsh spectrum of monotone Boolean 
function on 'low levels'. More precisely, expanding 

f(x) = f(S)w s (x) (1.2) 

Sc{0,...,n-1} 
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with 

ws(x) = YlejEj = l-2Xj 

jes 

the Walsh system on {0, l} n , one exploits that 

E i/^)i 2 

|S|>n2+ £ 

is small for monotone Boolean functions. This concentration is not 
sufficiently strong however to treat A instead of \x. 

Recall that for the majority function, by symmetry, f(S) = f(\S\) 
which obey 



-i 



Hence 



and 



Write 



|/(A;)| 2 ~ r k~ 3/2 for k > 0. (1.3) 



J2\f(S)\ 2 ~k-^ (1.4) 

\S\=k 

Ei/(^)i 2 ^^ 1/2 - (!- 5 ) 

|S|>fc 



TV , AT 



EAW/W = ^(EA(x))+iV E A(5)/(5). (1.6) 

1 1 0<|5|<n 

Introducing some cutoff n < n, estimate the second term of (2.6) by 

N E \MS)\\f(S)\ (1.7) 

0<\S\<n 

+ 

N E \HS)\\f(S)\. (1.8) 

no<|S|<n 

Because primes are odd (except for the prime 2), for 5 = (1, 0, . . . , 0), 

JV N 

k ^ = nJ2 A (*)(i - 2 *i) = E A (^) « - 1 - 

rr=l x=l 
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For < \S\ < y/n, S ^ (1, 0, . . . , 0), it follows from [B2] that 

\A(S)\ < e~ c ^. (1.9) 

Taking 

n ~n5- £ (1.10) 
the preceding permits to bound (11. 7p by 




(1.11) 



On the other hand, if we try to estimate ( 11. 8ft using L 2 -norm, the tail 
estimate ( 11. 5ft implies 

m < NV^( £ l/(5)| 2 )" < iV^ 1/4 (1.12) 

151 >n 

which is not conclusive, no matter how n <C n is chosen. 

Hence a more refined analysis is needed, involving more than just 
the low Fourier- Walsh spectrum of A. 

In what follows, we will rely in particular on estimates related to 
those in the work of Mauduit-Rivat [M-R] , where it was shown that A 
does not correlate with the parity function 

a(x) = e wE »<K»^ = w {0 ,i,..., n -i}(z) (1-13) 

(rather than the majority function). See also [Bl] from which we will 
borrow certain estimates. 

Before going further, we point out the following easy consequence of 
[B2J on prescribing binary digits from the primes. 

Theorem 2. Let p < I. Then, with above notations, taking r ~ n p , 
there are at least 0(2~ r ^) primes less than N satisfying 

n 

Xj >- + -r (1.14) 
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and at least 0(2 primes for which 

n 1 

En 1 
x,, < r. 
J 2 3 



It follows indeed from |B2] that for r < n?~ , the set 
Q\ = {p < N , x = xx = ■ ■ ■ = x T ._i = 1} 

satisfies 

N 

fii 2~ r . 

n 

Since also for 1 A < logn 

n-1 

x < N; Xq = ■ ■ ■ = x r _i = 1 and xy 



n — r 



r+l 



> 



Ay/n — r j 



< e" cA2 A^2- 



necessarily most elements of fii will satisfy 



n-1 



x i 

j=r+l 



n — r 



< 0(yn log 



and 



n-1 



i=o 



n + r ^/ \ 

Xi > — 0(\Jn\ogn). 



The second part of the statement is proven similarly, considering the 
set 

— {p < N; X\ = ■ ■ ■ = x r -i = 0}. 
Note that it is essential for this argument that r ^> nh . 
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2. Symmetrization of the Von Mangoldt function 
Returning to the proof of Theorem [lj we note that 

N 



^A(x)f(x) = (A,f) = (K.f) 



where A s stands for the symmetrization of A under the permutation 
group of {0, 1, . . . , n — 1}. Thus 

A s = £ E38en * A(g) l nfc (2.1) 
fc =i ^ 



where = {s G {0, l} n ; ^ x j = 

The advantage of introducing A s is a reduction of the L 2 -norm. For 
< p < 1, denote T p the usual convolution operator defined by 

and which is a contraction on all L p -spaces. Write 

(A, /) = (T p A, f) + ((1 - T p )A s , f) = (2.2) + (2.3). 

Then 

1 N 

(2.2) = + # Yl P lSl HS)KS) (2.4) 

1 0<|S|<n 

and estimate, recalling fll.9p the second term of (2.4) by 

0(4) + ^ £ A- 3/1 [£|A(s)|f 

v n <k<n \S\=k 

<0 (JL)+^V~ <0 (JL) (2 .5) 

provided, cf fll.lOp . we set 

p=l-n'^ +2e . (2.6) 
To estimate (2.3), we decompose further 

a, = a; + a'; ( 2.7) 

denoting 



A , = j2 EkeQk A{Xj i 

k 



ill- 



5 



with A > 1 a parameter. Then 

|<(1 - T P )A S , />| < |<(1 - T P )A' S , />| + llA^llx. 

Estimate 

K(i-t,x,/)|<||a;ii 2 ||(i-t,)/|| 2 



where 



.2^ 



iia; 



max 

|fc-§ \<Ay/E 



E^ fc A(x) 





< 



n 4 e 



CA 2 



max 



and, again form (11. 3ft . (2.6) 

H(i - r p )/|| a < VAffx^C 1 - /» fc ) 2 *- B/a " 



<v/iv[n - i/2 +x;^ i/2 (i-p) 2 " 



fc<no 



2 < n~l +2£ 



Hence 



" T P )A' S , f}\ < n^e^{ max [I £ A(x) 



'iV. 



Next 



|fc-§ |>AVn 
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Let R G Z + , R < logn and estimate, again using the correlation esti- 
mates of A with low order Walsh functions 



V in ^ |2B A 1^ |2A 



1 1 (2.12) 

< (CR) R n R N + {CR) R ( \M S )\) ~ (CR) R n R N. 

o<\S\<2R 

Therefore 

^ A(ar) < e~ cA2 N. (2.13) 
I |>av* 
It remains to establish a bound on 



E A (*) ( 2 - 14 ) 



for |fc- f| < A^in (2.11). 



3. Distribution of the sum of the digits of the primes 
Our remaining task is to bound (2.14) in the range k = | + 0(y/n). 



i ii 



Take a bumpfunction ijonR s.t. fj > 0, r)(0) = 1 and supp 77 C [— |, 2 
say. 
Clearly 

„i N 

A(x)= I [EA(x)e iA(E " a; ^ fc) ]? ? (A) ( iA (3.1) 



E 



and we evaluate 



where 



N 



E A w^w ( 3 -2) 



n-1 

f/ A (x) = e 3> . (3.3) 

This issue is very similar to the case of the Morse function (A = n) con- 
sidered by Mauduit-Rivat in [M-RJ. Thus we will use the Vinogradov 
type I-II sum approach from [M-R] . In what follows, we will in fact 
rely on the presentation in [Bl] (where the Moebius function rather 



than A is considered, but there is no essential difference here between 
these cases.) 

The Fourier coefficients of U\ obey an estimate 

Ux(k)\ <e~ cX2n . (3.4) 

The argument is similar to Lemma 2 in [BlJ. In case of the Morse 
sequence u>{o,i,...,n-i} = U n , one has in particular Hf^-Hoo < e~ cn which 
is stronger than (13 ,4p for small A. This is the most significant difference 
compared with [BlJ. 

Recall some terminology. Let n = m\ + m 2 ,Mi = 2 mi ,M 2 = 
2 m2 , mi < m 2 . 

Type-// sums are of the form 

2j a Xl b X2 U x {x l .x 2 ) (3.5) 

x 2 ~M 2 

where a Xl , b X2 are (arbitrary) bounded sequences (in fact obtained) as 
multiplicative convolutions of A and //) and we may assume M± > N%. 
For the Type-/ sums, we set b X2 = 1. Of course, the analysis of Type- 
// sums applies equally well to the Type-I sum but for the latter, also 
other considerations will be involved when Mi is small. 

We start by recalling the Type-// bound (2.31) from |Blj . which in 
view of (13.41) becomes 

| (1531) | < N(L- C1 + L 2 Mf C2 + L C3 Mf cA2 ) (3.6) 

where c±, c 2 , C 3 are some constants, L a parameter (note that [Blj treats 
the case of an arbitrary Walsh function w$, while for our purpose only 
the case S = {0, 1, . . . , n — 1} is of relevance). 
Optimizing (13. 6 j) in / gives a bound of the form 

NM^ C ' X \ (3.7) 

Next, according to |B1] . (3.2') and (13.4j) . the following estimate on 
Type-I sums is gotten 

M 2 M 2 \\U x \\ OQ <NM 1 e- cX2n . (3.8) 
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Assuming Mi > N^, (13 .7p gives a bound Ne~ cX2n on Type-// sums. 
The Type-/ sums may be estimated using either (13.71) or (13.81) . hence 
satisfy a bound N.e~ cXin , which is conclusive provided 

A > n-^ +£ . (3.9) 

The range (I3.9P is not quite sufficient for our needs. Consequently 
assume 

n-^ +E < A < n~i +£ (3.10) 
and in view of the already available estimates (13. 7p . (13. 8p . also 

c\ 2 n <m!< n £ \- 2 . (3.11) 

Take 

mi <C m <C n (3-12) 

to specify and decompose 

x = (y,z) e {0,l} m x {0,l} n ~ m . 

Write 

U x (x) = e^o-'vj) e^fs) = U(y)V(z). 

Hence 

£ | £ C/a(^ 2 )|<^ £ I ]T (3.13) 

Some further manipulation of £/(y) is needed. Write Ej — 1 — 2yj 
and 

, rrt , 

^(y) = e^(oos0 J](l + «^). (3.14) 

3=1 

Expanding the last factor of (I3.14p in the Walsh system 

l[(l + ie j tg±) = '£(itg^ y £w 8 (s)+ £ ■•• 

3=1 '" k<k± \S\=k k±<k<m 

= (3.15) + (3.16). 

Taking 

fci ~ X 2 m (3.17) 
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gives 

i 



||(3.16)|| 2 < £ |A|^) 5 <1 

m>k>k\ 

and the contribution of (3.16) in (13 . 13[) is bounded by 



cos -J 2 m 2 n ~ m <e~ cXm N. (3.18) 
Next, in (3.15), expand 

h = E Ws 

\S\=k 

in a regular Fourier series 

2 m -l 

%)= ^Mr)eg). (3.19) 

r=0 

Fixing < r < 2 m , substituting in (13.13!) . we obtain 



E j E e S 

x 1 ~Afi y=— xjmodi 1 ) 
2»n 



< 



2 r, 



(3.20) 



Let 5 > be another parameter and assume that 



2m- || 2 m J 

By the pigeonhole principle, there is some q' < | s.t. |||^|| < 
and therefore we get 

— = - + 9 (3.22) 

with 

?<^(o,g) = l and |0| < (3.23) 

Assuming 

5 > 2"T (3.24) 
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it follows that ||fsr|| > || — 1| — 2m \ ni > f, unless x 1 a = O(modg). If 
x 1 a = O(modg), Hf^H = x 1 !^! and we obtain the condition 



, nMi 

x < 



2 m \e\ 

In view of (3.25), this implies that 



W £ a^- (3-25) 

Let satisfy (3.22) with 

l 77 

9 < * and ^ ~ 2^- (3 - 25) 
We estimate ws(r). Thus, letting tp = ^ 

(a;o,...,3: m _i)G{0,l} m 

= 2" m + e 2 ™ 2 ^) - e 2 ™ 2 ^) 

m j&s 

and 

|tf>s(r)| = J j | cos7r2V| j 1 1 sin 7r2 J (,c | 
its j&s 

< I ] ( I cos 2^2 J - 1 + n2 j \6 |) J ] ( I sin tt2 j - + n2 j \9\ ) 

j<m—J j<m—J 

(3.27) 

with 1 < J C m to specify. 

By (3.26), 2i\6\ < < |.2- J and we take 

J~logi + A; (3.28) 
o 

as to ensure that 

\w s (r)\< Yl |cos2tt2 j -| J[ | sin tt2 j - + 5. (3.29) 
US q jes q 

j<m-.J j<m-J 
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Recall that \S\ = k < k\ ~ A 2 m. It follows that there is an interval 
{jo,--., ji - 1} in {0, [f]} of size 

m 



2k x 



(3.30) 



which is disjoint from S. The first factor in (3.29) is then majorized by 

2h -oo -l 



n 

j6/ 



• a 

cos 2tx2 j — 



1 



-jo 



< 



Q 



< 



£ ■ 

1 



(3.31) 



2^° 52 W 

provided q is not a power of 2. On the other hand, if q is a power of 2, 
then sin7r2 J - = for j > log g and we conclude that 



\w s {r)\ <5+ -K^ <5+ -e~ cX 2 
§2 2fc i o 



(3.32) 



except if S* C {0, 1, . . . , J} U {m — J, . . . , m — 1}. 

Consequently, the contribution of the A>term of (3.15) in ( I3.13P may 
be estimated as follows 



2"( cos- 



A * 
y 2 



i<*+ 



m 

A: 



5 + 

o 



2J 

+ | ) max||iu s ||i 
A; / |5|=fc 



}■ 

(3.33) 



with J given by (3.28). 

Making a suitable approximation of the step-function by Fourier- 
truncation (cf. |B1] for details), with an L 1 -error at most m~ k say, we 
ensure that 

\\w s \\i <{ck\ogn) k (3.34) 

and hence 



Nil < 



m 
k 



(ck log n) 



(3.35) 



Substituting (3.34), (3.35) in (3.33), we find 



clogT 



(3.33) < 2 n e- cym \m 2k 5 + m k 5- 1 e- cX ~ 2 + ( 1 + f (cA;Alogn) fc }. 



k 



(3.36) 
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Taking 5 = m 2k gives 

(3.33) < 2 n e~ cX2m (l + m 3fcl e- cA " 2 + (CJfe^lognfA)*' 

< 2 n e- cX2m (l + e ^ogn)^m-cx-^ + (cr(logn) 2 A 3 m) fc ). 

(3.37) 

Recalling (l3~T0|) - fl3TT2|) . take 

m = — — min(A~ 3 , n). (3.38) 

(logn)^ 

Then 

(3.37) < 2 n e- cX2m < 2 n e - cilosnr2m ' m{x ' 1 ' x2n) (3.39) 

which gives a bound for the (3.15)-contribution to (13. 13ft . 

Thus we proved that if n~^ +£ < A < n~^ +e and X 2 n < mj < p-, 
then 

( jSISD < Ne~ n \ (3.40) 

Summarizing, we conclude that (13. 2p may certainly be bounded by — 
provided A > nrh +e . 

Consequently, substituting in (13 .ip gives 

A(ac) < Nn-'^ +£ . (3.41) 



4. Conclusion of the proof of Theorem 1 
Substitution of (3.41) in (2.9) gives 

||A'J 2 < e CA2 n e N (4.1) 

and in (2.11) 

\((l-T p )A' s ,f)\<e CA2 n-^N. (4.2) 
Recalling (2.5) and (2.13), we proved that 

N 



(A, /) < o(4=) + e CA2 n-^ s N + e'^ N 
< n~ c N 



for some constant c > 0, by suitable choice of A. 
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(4.3) 



Hence, Theorem [T] holds in the more precise form 
- N 

J2A(x)f(x) = - + 0(n- c N). (4.4) 
1 
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